AP® STATISTICS 
2010 SCORING GUIDELINES 


Question 6 


Intent of Question 





The primary goals of this investigative task were to assess students’ ability to (1) produce and comment on 
a graphical display; (2) calculate a test statistic based on rank data; (3) use simulation results to draw an 
appropriate conclusion. 





Solution 
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Part (b): 


In all three regions (Gulf Coast, Florida, Lower Atlantic) the hurricane damage amounts tend to 
decrease as distance from the coast increases. For almost all given distances from the coast, the 
Florida region has the largest damage amounts. Also, for any given distance, the Gulf Coast and 
Lower Atlantic regions have similar damage amounts but with the Lower Atlantic damage amounts 
generally smaller. 


Part (c): 


For the “10 to 20 miles” distance category: The Florida region has the most damage (3.0 million dollars 
per acre) and so has rank 1. The region with the second-most damage is the Gulf Coast (1.7 million 
dollars), obtaining rank 2. The Lower Atlantic region has the least damage (0.3 million dollars) and so 
has rank 3. The last columns of the table should be filled in as follows: 
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Question 6 (continued) 





























10 to 20 miles Average Rank 
Gulf Coast 2 2.0 
Florida 1 1.2 
Lower Atlantic 3 2.8 
2+2+3+1+2 
The average ranks are computed St =2.0 for the five Gulf Coast damage ranks, 
a =1.2 for the five Florida damage ranks and a =2.8 for the five Lower Atlantic 


damage ranks. 
Part (d): 


The calculated value of the test statistic O is 
Q =5|(2.0-2)° $229) +(2.8-2)'| =5[0+0.64+0.64]=6.4. 


Part (e): 


A O value of 6.4 or larger occurred in 





= = 0.039 (or 3.9 percent) of the 1,000 repetitions. All 1,000 


repetitions of the simulation assumed there was no difference in the distribution of damage amounts 
among the three regions. This is a fairly small (approximate) p-value (less than 0.05), indicating that a 
test statistic as large or larger than the observed test statistic of Q =6.4 would be fairly unlikely to 


occur by chance alone if there really was no difference among the regions for each distance category. 
The sample data therefore provide reasonably strong evidence that there is a difference in the 
distributions of hurricane damage amounts among these three regions. 


Scoring 


This question is scored in four sections. Section 1 consists of part (a); section 2 consists of part (b); section 
3 consists of parts (c) and (d); section 4 consists of part (e). Each of the four sections is scored as essentially 
correct (E), partially correct (P) or incorrect (I). 


Section 1 is scored as follows: 


Essentially correct (E) if the response includes a well-labeled, statistically valid graph that allows for 
comparing damage amounts both across regions AND at varying distances from the coast. 


Partially correct (P) if the response includes a well-labeled, statistically valid graph that allows for 
comparing damage amounts across regions or at varying distances from the coast but not both OR 
if it includes a statistically valid graph that allows for both comparisons but lacks labels for either the 
regions or the distances (or both). 


Incorrect (I) if the graph does not allow for either comparison OR if it is poorly labeled and allows for 
only one comparison. 
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Question 6 (continued) 
Section 2 is scored as follows: 


Essentially correct (E) if the response includes BOTH of the following: 
e Mention of a difference — a contrast among the three regions across distance categories (e.g., 
“Florida has greater damage than the other two regions at most distances”) 


e Mention of a similarity — a statement that in all regions hurricane damage decreases with 
greater distance from the coast 


Partially correct (P) if the response includes a valid statement about only a difference or about only a 
similarity. 


Incorrect (I) if the response has neither a valid statement about the difference nor a valid statement 
about the similarity. 


Note: A statement that “Florida has more damage” is not sufficient to describe a difference if “at most 
distances” is not specified. Phrases that imply comparisons across distances such as “generally” or “for 
the most part” are acceptable for the “at most distances” specification. 


Section 3 is scored as follows: 


Essentially correct (E) if the response includes the correct ranks in part (c) AND the correct test 
statistic calculation in part (d). 


Partially correct (P) if the response includes either of the following: 
e The correct ranks in part (c) but not the correct test statistic calculation in part (d) 


e Incorrect ranks in part (c) but the test statistic calculated correctly in part (d), using those 
incorrect ranks 


Incorrect (I) otherwise. 


Notes 
e Calculation of the average ranks need not be shown to receive credit for correctly calculating QO. 


e A miscalculation of O results in no credit for part (d), but the severity of the miscalculation may be 
considered later in holistic scoring. 


Section 4 is scored as follows: 
Essentially correct (E) if the response includes BOTH of the following components: 


e Avalid quantification of the tail probability of the test statistic O based on the simulated 
O values 


e Aconclusion, in context, consistent with the rarity (or likelihood) of the calculated test statistic 
O relative to the simulated O values 


Partially correct (P) if the response includes only one of these two components. 


Incorrect (I) if the response does not include either of these two components. 
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Question 6 (continued) 


Each essentially correct (E) section counts as 1 point. Each partially correct (P) section counts as % point. 


4 Complete Response 

3 Substantial Response 
2 Developing Response 
1 Minimal Response 


If a response is between two scores (for example, 2% points), use a holistic approach to determine whether 
to score up or down, depending on the overall strength of the response and communication. 


© 2010 The College Board. 
Visit the College Board on the Web: www.collegeboard.com. 









STATISTICS 6A 
SECTION It 
Part B 
Question 6 
Spend about 25 minutes on this part of the exam. 
Percent of Section II score—25 


Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the 
correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


6. Hurricane damage amounts, in millions of dollars per acre, were estimated from i insurance records for major 
hurricanes for the past three decades. A stratifie ve J f 





e southeastern Uni e 
yast (Alabama, Louisiana, Mississippi), Florida, and Lower Atlantic (Georgia, South 
Carolina, North Carolina). Damage amounts in millions.of dollars per acre, adjusted for-inflation, are shown in 


the table below. 


HURRICANE DAMAGE AMOUNTS IN MILLIONS OF 
DOLLARS PER ACKE 










eee from ene 


| 2to 5 miles | to 5 miles 5 to 10 miles | to 10 miles ; 10 to 10 to 20 mits | miles 

rere por [om on fw |e 

ne ee ce 

| Mee [me foe] ee |e | | 
Atlantic _ 

ketch a graphical display that compares the hurricane damage amounts per acre for the three different 


coastal regions (Gulf Coast, Florida, and Lower Atlantic) and that also shows how the damage amounts vary 


with distance from the coast. 
4 





‘ 





ig (WA | 
fo sia 1" GO ON TO THE NEXT PAGE. 


Distance Exm Coast 


© 2010 The College Board. 
Visit the College Board on the Web: www.collegeboard.com. 


. (b) Describe differences and WS inn, in the hurricane damage amounts among the three regions. 
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Because the distributions of hurricane damage amounts are often skewed, statisticians frequently use rank values 
to analyze such data. 


(c) In the table below, the hurricane damage amounts have been replaced by the ranks 1, 2, or 3. For each of the 
distance categories, the highest damage amount is assigned a rank of 1 and the lowest damage amount is ~ 
assigned a rank of 3. Determine the missing ranks for the 10-to-20-miles distance category and calculate the 
average rank for each of the three regions. Place the vans in the table below. 


ASSIGNED RANKS WITHIN DISTANCE CATEGORIES 


oes from. ea | 





GO ON TO THE NEXT PAGE. 
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(d) Consider testing the following hypotheses. 


H_: There is no difference in the distributions of hurricane damage amounts among the 


three regions. 
H.: There is a difference in the distributions of hurricane damage amounts among the three regions. 


If there is no difference in the distribution of hurricane damage amounts among the three regions (Gulf 
Coast, Florida, and Lower Atlantic), the expected value of the average rank for each of the three regions is 2. 
Therefore, the following test statistic can be used to evaluate the hypotheses above: 


o=5|(R,-2) +(R,-2) +(R,-2)' | 


where R, is the average rank over the five distance categories for the Gulf Coast (and R, and R ', are 
similarly defined for the Florida and Lower Atlantic coastal regions). 


Calculate the value of the test statistic Q using the ih ranks you obtained in a (c). 


sf (2-2) (be) 1 (26 2)" 


spo + 64 + 644 
gee" 


(e) One thousand simulated values of this test statistic, Q, were calculated, assuming no difference in the. 
distributions of hurricane damage amounts among the three coastal regions. The results are shown in the 
table below. These data are also shown in the frequency plot where the heights of the lines represent the 
frequency of occurrence of simulated values of Q. 


Frequency Table for Simulated Values of Q 


Sliced sew 
Frequenc Percent 

| oo | 648) | 48 4.80 | 480 | 
| o4 {262 1 310 _1_26.20_ 131.00 
2 | _168 | 478 | 1680_| 47.80 
| 153 | 631 15.30 | 63.10 
| 186 | ~=—817_~—s| «18.60 | 81.70 | 
| 36 | 59 | 876 | 5.90 | 87.60 | 
| 48 | 33 | 909 =| 3.30 | 90.90 
| 5.2 | 52 | 961 | 5.20 | 96.10 | 
| 64 | 16 =| 977 sts 160 | 97.70 
| 76 [| 15 | 992 | 150 | 99.20 | 
| 84] 6 | 99 | 060 | 99.80 | 
| 100 {| 2 | 1000 | 0.20 | 100.00 | 



















GO ON TO THE NEXT PAGE. 
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i FREQUENCY PLOT 
262 
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Simulated Values of Q 


Use these simulated values and the test statistic you calculated in part (d) to determine if the observed data 
provide evidence of a significant difference in the distributions of hurricane damage amounts among the three 
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STATISTICS 
SECTION I 
‘Part B.- 
- Question 6 
Spend about 25 minutes on this part of the exam. 
Percent of Section II score—25. 


Directions: Show:all your work. Indicate clearly the methods you use, because you will be scored on the 
correctness of your,methods as well as. on the accuracy and completeness of your results and explanations. ao 





_6. Hurricane dariage amounts, in millions of dollars per acré, were estimated from insurance records for major 
_ hurricanes for the past three‘decades. A stratified random sample of five locations (based on categories of 
distance from the coast) was selected from each of three coastal regions in the southeastern, United States. The 
threé regions were Gulf,Coast (Alabama, Louisiana, Mississippi), Florida, and Lower Atlantic (Georgia, South 
Carolina, North Carolina). Damage amou of d ju i 
the table below. . 








amounts in millions of dollars per acre, adjusted for inflation, are shown in 
HURRICANE DAMAGE AMOUNTS IN MILLIONS OF 
a “DOLLARS PER ACRE 


=. Distance from Coast | 
| <1 mile | 1 to 2 miles 5 to 10 miles | 10 to 20 miles 







(a) Sketch a graphical display that compares the hurricane damage amounts per acte for the three different 
" coastal regions (Gulf Coast, Florida, and Lower Atlantic) and that also shows how the damage amounts vary © 
with diganpe from Ty foast Homicane Demagt oe 
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6 BQ 
0) Describe differences.and similarities in the hurricane damage amounts among the three regions. - 

"AM Yarte coastal regions’ damagu amounts are - 
right skewed , Which means that the closer Cun, Of these 
VEGTONS arts Were to tne coast. He, more damage in millions: 

- pctuwtd. Also, ne Florida ' had significanty yore damage 
:  Closeto the Coast line 'B mile or fess, ‘However, Me Gvif Coast 

hadthe most damage amounts . 5-10 miles fromthe 

Coast. Overtul the lower aHanhe had The (east hormone — 


ae cumounts ‘ial He barcas. 


Because the distributions of hurricane damage amounts a are often cence ‘statisticians frequently use rank values 
to analyze a data: a a 


(c) In the table below, the hurricane damage amounts have been. replaced by the ranks 1, 2, or 3. For each of the 
distance categories, the highest damage amount is assigned a rank of 1 and the lowest damage amount is 
"assigned a rank of 3. Determine the missing | ranks for the 10-to-20-miles distance category and calculate the 

average rank for each of the three een: Place the values i in the table below. 


_ ASSIGNED RANKS. WITHIN DISTANCE CATEGORIES 7 
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(d) Consider testing the following hypotheses. 


‘H; There i is no Renee in n the distributions of hurricane damage amounts among ‘the 
three’ regions. ; 
Hy There i is a difference in the distributions of hurricane damage amounts among the three regions. 


. If there is no. difference i in the distribution of hurricane damage amounts among the fies regions 1s (Gulf 
- Coast, Florida, and Lower Atlantic), the expected value of the average rank for each of the’ three regions is 2. 
Therefore, the following test statistic can be used to evaluate the hypotheses or, 


gr = 3{(Re- aN -(R, -af +R -2)'] 


’ where R, “is the average ar over the five distance: categories for the Guif Coast (and R, and R, are 
similarly defined for the Florida and Ldwer Atlantic coastal regions). 


“Calculate the value of the test statistic Ge using ete average rank you obtained 4 in pan . “a as 


Q.=2 - Oe “slr ?. cil ea 2] 





(e) One th thousand. simulated values, thi statistic, O, were Bicainted assuming no difference i in the 
distributions Of hurricane dainage amounts ‘Airong the three coastal régions. The results are shown in the 
table below. These data are.also shown inthe frequency. Plot where the Pores of the lines ceeeee the 
Pe oreo - occurrence of amulets values of Q i : 









» Freguchey Table for - Siinulated Values of iQ” 


Frequency ‘Cumulative Percent Cumulative 
Frequene Percent_ : 
00 [48 | 48 __4.80_|_ 4.80 
[oa | 262 3104 26.29 | 31.00 
a 
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4. 5 8 9 10 
Simulated Values of Q | 
Use these simulated values and the test statistic you calculated in part (d) to determine if the observed data . 


provide evidence of a significant difference i in the distributions of hurricane a amounts among the three 
coastal regions. Explain. 


Q=64 according to muy calculations. | 
Ontne ov raph anc chawt, shows that this Wome 
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STATISTICS « # 6 C | 
SECTION I 
Part B 
_ Question 6 
Spend about 25 minutes on this part of the exam. 
Percent of Section I score—25 


Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the 
correctness of your methods as well as on the accuracy and completeness of your results and explanations. — 


6. Hurricane damage amounts, in millions of dollars per acre, were estimated from insurance records for major 
hurricanes for the past three decades. A stratified random sample of five locations (based on categories of 
distance from the coast) was selected from each of three coastal regions in the southeastern United States. The 
three regions were Gulf Coast (Alabama, Louisiana, Mississippi), Florida, and Lower Atlantic (Georgia, South 


Carolina, North Carolina). Dantes amounts in millions of dollars per acre, adjusted for inflation, are shown in 
the table below. 


HURRICANE DAMAGE AMOUNTS IN MILLIONS OF 
DOLLARS PER ACRE 














Distance*from Coast i 
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(a) Sketch a graphical display that compares the hurricane damage amounts per acre for the three different 
coastal regions (Gulf Coast, Florida, and Lower Atlantic) and that also shows how the damage amounts vary 
with distance from the coast. 
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ae 
(b) Describe differences and similarities in the hurricane damage amounts among the three regions. | 
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Because the distributions of hurricane damage amounts are often skewed, statisticians frequently use rank values 
to analyze such data. 


(c) In the table below, the hurricane damage amounts have been replaced by the ranks 1, 2, or 3. For each of the 
distance categories, the highest damage amount is assigned a rank of 1 and the lowest damage amountis 
assigned. a rank of 3. Determine the missing ranks for the 10-to-20-miles distance category and calculate the 
average rank for each of the three regions. Place the values in the table below. 


ASSIGNED RANKS WITHIN DISTANCE CATEGORIES 
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(d) Consider testing the following hypotheses. 


€) 


H,;: There is no difference in the distributions of hurricane damage amounts among the 


three regions. 


oz 3|(R, ~2) +(R, -2) +(R, - 2)'| 


Calculate the value of the test statistic Q using the average ranks you obtained in part (c). 


Qe [(a-2Y" srry e (u4t-7Y | 


Q2e6.4 


table below. These data 
frequency of occurrence of simulated values of Q. 


Frequency Table for Simulated Values of Q 


1h aces ——] ole 
Frequenc Percent 
foo; 648) | CS 48] 4.80 | 480 | 
[16 [ 153 | 631 | 15.30 | 63.10 | 
| 48 | 33 | 909 | 3.30 | 90.90 | 
[64 [ 16 | 977 | 160 | 97.70 | 
| 7.6 | 15 | «992 | 150 | 99.20 | 
[ 34] 6 | 998 | 060 | 99.80 | 


























6 Cj 


H a There is a difference in the distributions of hurricane damage amounts among the three regions. 


If there is no difference in the distribution of hurricane damage amounts among the three regions (Gulf 
Coast, Florida, and Lower Atlantic), the expected value of the average rank for each of the three regions is 2. 
Therefore, the following test statistic can be used to evaluate the hypotheses above: 


where R, is the average rank over the five distance categories for the Gulf Coast (and R, and R, are 
similarly defined for the Florida and Lower Atlantic coastal regions). 


One thousand simulated values of this test statistic, Q, were calculated, assuming no difference in the © 
distributions of hurricane damage amounts among the three coastal regions. The results are shown in the 
also shown in the frequency plot where the heights of the lines represent the © 


GO ON TO THE NEXT PAGE. 
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FREQUENCY PLOT 
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~~ Simulated Values of Q | 


Use ticse simulated values and the test statistic you calculated in part (d) to determine if the observed data 
provide evidence of a significant difference in the distributions of hurricane damage amounts among the three 


coastal regions. Explain. 
This Goes wot prove. oadonce enre Geto: s 
a siguvR cant ditterenm iA tt dlittriloutens of hurricane 
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Question 6 
Overview 


The primary goals of this investigative task were to assess students’ ability to (1) produce and comment on 
a graphical display; (2) calculate a test statistic based on rank data; (3) use simulation results to draw an 
appropriate conclusion. 


Sample: 6A 
Score: 4 


In part (a) the student draws a well-labeled bar graph in which the bars are grouped by the distance 
categories, and a key is provided for ease in identifying the three regions. Section 1, consisting of part (a), 
was scored as essentially correct. In part (b) Florida's having “the most hurricane damage in 4 of the 5 strata” 
is correctly identified as a significant difference. Although it was not necessary to describe the categorical 
distances as strata (as provided in the stem of the question), this is a nice detail. Also, the fact that damage 
decreases as the distance from the coast increases is correctly identified as a major similarity among the 
regions. Section 2, consisting of part (b), was scored as essentially correct. In parts (c) and (d) the ranks, 
average ranks and test statistic, O, are correctly calculated, so section 3, consisting of parts (c) and (d), was 
scored as essentially correct. In part (e) a simulated p-value of 0.039 is correctly identified for the test statistic 
value of O = 6.4 and is used to provide an appropriate conclusion in context. Section 4, consisting of part (e), 
was scored as essentially correct. With all four sections essentially correct, the response earned a score of 4. 


Sample: 6B 
Score: 3 


In part (a) the student draws a well-labeled bar graph, grouping the bars by distance, and a key is provided for 
ease in identifying the three regions. Section 1, consisting of part (a), was scored as essentially correct. In part 
(b), although the term “right skewed” is troublesome because of the categorical nature of the distances, the 
major similarity among the regions is correctly identified by indicating that damages increase closer to each 
of the coasts. The differences among the three regions are not completely specified for all five distance 
categories, so section 2, consisting of part (b), was scored as partially correct. In parts (c) and (d) the ranks, 
average ranks and test statistic, O, were correctly calculated, and section 3, consisting of parts (c) and (d), 
was scored as essentially correct. In part (e) no p-value for the test statistic is explicitly identified, but instead 
0.016, the simulated probability of getting O = 6.4, is used to provide a conclusion in context. Section 4, 
consisting of part (e), was scored as partially correct. With two sections essentially correct and two sections 
partially correct, the response earned a score of 3. 


Sample: 6C 
Score: 2 


In part (a) the graph is a well-labeled plot of the categorical distances on the vertical axis and damage 
amounts on the horizontal axis, and a key is provided for ease in identifying the three regions. Section 1, 
consisting of part (a), was scored as essentially correct. In part (b), although the phrase “similar non linear 
shapes” is correct for the graphs, the response does not convey the nature of relationship (decreasing 
damages as distance from the coast increases), and none of the statements provides a complete response for 
differences among the regions for all distances. Section 2, consisting of part (b), was scored as incorrect. In 
parts (c) and (d) the ranks, average ranks and test statistic, O, were correctly calculated. Section 3, consisting 
of parts (c) and (d), was scored as essentially correct. In part (e) the simulated O values are described, but an 
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Question 6 (continued) 


approximate p-value is not identified, and the test statistic O = 6.4 is determined to be “not that uncommon.” 
Hence, section 4, consisting of part (e), was scored as incorrect. With two sections essentially correct and two 
sections incorrect, the response earned a score of 2. 
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